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1. INTRODUCTION 

The growth of e-commerce industry has increased the demands of courier services, especially during 
seasonal peaks and sales, further increasing the workload of the already stretch thin workforce of courier 
companies [1]. In times like this, arranging a feasible staff's schedule for a courier distribution centre is 
essential to achieve maximum efficiency to fulfil market’s demands.This problem has been a challenge to 
courier companies as they strive to serve the demands. This paper proposed an evolutionary algorithm to 
produce an optimal staff’s schedule based on one of the international courier companies in Malaysia as a case 
study. 

Due to the arising of e-commerce industry, the logistics industry has a rapid increase in demand to 
accommodate to consumers’ needs. The logistics industry in Malaysia has been given special emphasis in the 
industrial master plan 3 2016-2020 (IMP3), which further pushes the growth of the local logistics industry. In 
a report by DHL Express and IBM, the logistics industry is more than ready to integrate with artificial 
intelligence (AI) technologies as it is now accessible and affordable to a company seeking performance boost 
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[2]. In addition, based on research conducted on courier service quality, one of the main criteria for a good 
courier service is the implementation of information and communication technology (ICT) technologies in 
the service itself [3]. ICT technology can be applied in various activities of logistic industry. For instance, in 
operations planning, insights can be provided to forecast parcel volumes using reliable data [4]. 

In scheduling automation using AI, Job shop scheduling (JSP) is a combinatorial optimization 
problem classified as NP-hard [5]-[9]. JSP involves assigning various jobs, otherwise called processes, with 
varying attributes to a set of machines to be processed or completed. The goal of a standard JSP is to 
minimize the makespan of the whole process [10]. As for the problem of the courier distribution centre, with 
reference to standard JSP, machines, M can be mapped to staffs, S. Whereas Jobs, J can be mapped to flow, 
F. Operations within a job, O can be mapped to processes, P. A flow is complete when all processes had been 
completed by at least one staff. Make span in the context of JSP, can be mapped to the time taken to 
complete the whole operation. There are several hard constraints and soft constraints regarding process 
assignment to staff. Since this problem of staff scheduling for the courier distribution center is an extended 
version of standard JSP, it is also a NP-hard problem. 

In solving a NP-hard problem, metaheuristics techniques are more widely used against other 
techniques. This is due to its capability to find the optimal solution within reasonable time [11]. Some of the 
metaheuristic techniques used in scheduling problems are evolutionary algorithms including genetic 
algorithm, differential evolution, artificial bee colony, ant colony optimization, memetic algorithm and many 
more [8], [12]-[14]. In JSP, genetic algorithm is the most used method [15]. Existing literature on courier 
centre distribution are focusing on routing and delivery problems, and several works that solving the problem 
using metaheuristic techniques are reviewed [16]-[20]. Based on the reviews done, the solution proposed to 
the scheduling problem is highly dependent on the characteristic of the problem itself especially on how the 
problem and constraints are formulated. However, it is proven that meta heuristic technique is a good 
solution for such problem, with a common technique being used is genetic algorithm [21]. 

The remainder of the paper is organised as follows. Section 2 presents the proposed algorithm for 
the problem. Section 3 present the result the algorithm, and finally, Section 5 summarises and concludes the 


paper. 


2. METHOD 
2.1. The problem’s background 

A courier distribution center processes incoming parcels and redistribute the parcels towards their 
destination. The courier distribution center used as a case study handles two operations, inbound, and 
outbound. Inbound operation receives incoming parcels from a flight and distribute the processed incoming 
parcels locally through trucks. Outbound operation receives parcels from courier trucks locally, processes 
them in the reverse order of inbound operation for international distribution. Figure 1 and Figure 2 depicts the 
general inbound and outbound operation respectively. A unit load device (ULD) is a shipment container 
containing all the incoming parcels unloaded from a flight. Each operation has different flow; each handles a 
type of parcel. There are three types of parcels: conveyable (COY), flyer (FLY), and non-conveyable (NCY). 
The term Flyer also refers to two types of parcel DOX and WPX. 

The clearance rate by arrival of parcel at the centre is almost always at 80% of total incoming parcel 
volume. A term for cleared parcels upon arrival at the centre is cleared on arrival (COA). Parcels that is not 
cleared upon arrival are sent to bond cage (TBC) to await instructions. 

In general, there are several flows of process handled at the courier distribution centre. Each of the 
flow corresponds to one type of parcel. If the parcels are further divided to COA or TBC, another flow is 
required. For flyers, this process is entirely managed by conveyor belt, hence requiring no extra flow. To 
summarize, a courier distribution centre generally handles five flows, based on parcel’s type and clearance. 

The goal of the staff's scheduling is to minimize the time to complete an operation. Based on the 
courier distribution centre requirement, an operation must be completed within one hour. Hence, the 
measurement of staff scheduling is parcel per hour (PPH), referring to the number of parcels able to be 
processed by a staff within an hour. A process also has a target PPH, referring to the number of parcels need 
to be processed within an hour by staffs at a particular process point. PPH of staff is observed and recorded 
based on the staffs performance on a process. PPH of processes varies according to total incoming parcel 
volume and parcel type. Table 1 shows breakdown of parcel volume by parcel type. 

There are constraints to be satisfied when assigning staffs to processes. First, all processes must be 
assigned to at least one staff, leaving no process unassigned. Second, a staff cannot be assigned to concurrent 
processes of different flows. Lastly, the PPH value of each process must be satisfied by the combined PPH 
value of all staffs working on a particular process. This is to ensure that one whole operation can be 
completed within the time frame of | hour as required by the courier company. 


Indonesian J Elec Eng & Comp Sci, Vol. 27, No. 2, August 2022: 1043-1050 


Indonesian J Elec Eng & Comp Sci ISSN: 2502-4752 øO 1045 


ULD BROUGHT INTO CENTRE 


+ 


UNLOAD PARCELS FROM ULD (i) 


© @) © 


TRANSFORT SORT PARCELS BY 
PANCETO DESTINATION 
PROCESS POINT 


| PARCEL UNLOAD FROM TRUCK, | 


FLYER/DOX 


COA? 
(CUSTOM 
LEARED?, 


REWEIGH/CHECK 
PARCEL 


© 


REWEIGH/CHECK 
PARCEL 


(CUSTOM 


SEND TO BOND LEARED? 


CAGE (USE 
FORKLIFT) 


ARRANGE PARCEL 


LOAD TO TRUCK 


Figure 1. Inbound operation 
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Figure 2. Outbound operation 


Table 1. Parcel volume by parcel type 


Parcel Type Percentage of total volume Breakdown 
COY 59% - 
FLY 40% - 

- DOX 25% of 40% 

- WPX 75% of 40% 
NCY 1% - 
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2.2. The evolutionary algorithm 

The proposed evolutionary algorithm is a genetic algorithm. The aim is to produce optimal staff 
schedule by minimizing the total processing time and at the same time minimize the number of staff used. 
The schedule produced also have to comply with several constraints outlined in the problem. Algorithm 1 
illustrate the overall flow of a genetic algorithm and the following sections will elaborate in detail each of the 
algorithm’s component. 


Algorithm 1. Proposed genetic algorithm 


T% bestFitness = 0 

2. bestSchedule = 0 

oie create an initial population Population of PopSize individuals, based 
on the initialisation procedure in Section 2.1 

4, while termination condition is not true do 

5; for X € Population do 

6. Calculate X fitness value, F(X), penalised if X has the invalid or 

illogical schedule 

he if F(X) > bestFitness then 

8. Replace bestFitness 

9. bestPlan = X 

0+ end 

Ts end 

Ze Select individuals from the Population based on binary tournament 
selection 

Bia Probabilistically apply the crossover operator to generate new 
individual 

4. Probabilistically select individuals for mutation 

5. Use the new individuals to replace the old individuals in the 
Population 

6. end 

7. Output bestFitness 

8. Output bestSchedule 


2.2.1. Chromosome representation 

Each gene in a chromosome represents a staff selected for scheduling. For instance, gene 1 
represents staff 1, gene 2 represents staff 2 and so on. The gene’s value is a set of processes assigned to the 
staff in a range of O to the highest number of processes in all flows selected. For instance, two flows are 
selected for scheduling, flow 1, and flow 2. Flow 1 has four processes, flow 2 has three processes. Flow 1 has 
the highest number of processes in all flow. Hence, a staff is only possible to have been assigned a maximum 
of 4 processes, since a staff cannot be assigned to concurrent processes. When a staff has been assigned less 
than 4 processes, the gene is filled with 0 to ensure all chromosomes have equal length. The genes are 
separated by the indicator ‘|’. The order in which the processes are assigned matters. They represent the order 
of work for the staff. Figure 3 depicts the chromosome representation. Processes assigned are represented as 
alphabets. 

The gene values are assigned randomly on the condition there is no clashes among processes for a 
staff. After the random initialization, the set of processes will be sorted according to its precedence value. 


Sorting the processes according to their precedence eliminates errors of reversed work flow in the 
chromosome. 


A.B.C.0| F,0.0.0| B.D.0.0| E.G.0.0| D.L.0.0| F.K.L.0| EH.L.M 


Figure 3. Chromosome representation 


2.2.2. Fitness evaluation 

The fitness function is implemented using the normalized weighted additive utility function 
(NWAUF) [22]. There are two objectives that will be minimized: i) the time taken to complete the 
operations, and ii) the number of staff used. These two objectives are calculated separately, normalized, 
multiplied by a weight, and added together in one function. With the weight multiplied with the objectives, 
the importance of each objective can be adjusted easily according to preferences or algorithm performance. 

The first objective, TIME refers to the time taken to complete the operations. It is calculated by sing 
the combined PPH value of all staffs assigned to a process divided by the PPH of a process. This will roduce 
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the time needed to complete the process, in a range of (0, 1), 1 indicating one hour is needed, since number of 
parcel is divided by the parcel per hour of staffs. The resulting value will be in the unit of one hour as well. 

The second objective, STAFF refers to the number of staffs involved in operations in this generated 
schedule or chromosome. It is calculated by counting the number of staffs involved, divided with the total 
number of staffs. With this, the staff objective is measured as the lower the value, the better the solution is. 
Since this is in contrary to the first objective, it is first multiplied by the staff weight, SW, then minus by the 
staff weight. TW refers to time objective’s weight. 


TIME = (1/total time * TW) (1) 
STAFF = (SW- (staffs involved/total staffs * SW)) (2) 
fitness = STAFF + WEIGHT (3) 


Penalties are applied to chromosomes that produce invalid or illogical schedule. Two types of 
penalties are introduced, namely, the clash penalty and invalid penalty. Both of the penalties are calculated 
and then added to the fitness value defined in (3). The clash penalty calculates clashes among the number of 
processes assigned to each staff by their precedence. In other words, if process A with precedence value 1 
and process B with precedence value 1 is both assigned to a staff, one clash is calculated. This calculation is 
accumulated for every gene in a chromosome to find out the total number of clashes. A generated schedule 
with clashes among processes is illogical because a staff cannot be working on two processes with the same 
precedence as these happen concurrently. The number of clashes calculated is divided by the maximum 
number of clashes possible, then, multiplied by a weightage. The result is then minus by the weightage itself. 
This is presented in (4). 


clashes 


) ax Weighteiasnh in) (4) 


illogical penalty = Weight ash penalty — (2 
Where clashes is the number of clashes among assigned processes, whereas maximum is the total possible 
clashes. 

The invalid penalty calculates the number of processes that is not assigned to any staff at all. A 
generated schedule is invalid if there are unassigned processes. Unassigned processes in (5) refers to number 
of unassigned processes, maximum refers to total number of processes. 


unassigned processes 


maximum 


invalid penalty = Weightinyatia penalty — (( ) = Weightinvatia senile) (5) 


Both penalty values are added to the fitness value earlier. A good chromosome will have higher 
fitness value whereas a bad chromosome will have a lower fitness value. To further differentiate among 
invalid and valid chromosomes, a threshold value is applied to the fitness value. If the chromosome has at 
least 1 clashes or unassigned process, the evaluated fitness value will be multiplied by sum of the weightage 
of both of the penalties. With this, the fitness value of chromosomes with clashes or unassigned processes 
will not be higher than the threshold. For instance, a chromosome with fitness value 0.90 is very good in 
terms of time taken to complete whole operations, however, if it has at least 1 clashes, the value is then 
multiplied by the sum of both penalties, say, (0.3+0.3)=0.6. Hence, 0.9*0.6=0.54 as the final fitness value. 
Hence, in this case the threshold is the sum of both penalties, 0.6. Any chromosomes with fitness value 0.6 is 
valid. The formula is as (6). 


fitness = fitness * (invalid penalty + illogical penalty) (6) 


2.2.3. Selection processes and genetic operations 

The strategies for the selection processes and genetic operations are based from existing literature 
with similar work [23]-[25], and then the strategies are modified to suit with the problem. The strategies are 
implemented and tested to choose the final strategy for the evaluation process. For the selection, the 
Tournament Selection will be used to select parents, and survival selection uses elitism technique. Elitism 
technique retains the few top performing chromosomes from the old generation to be brought into the next 
generation. The rests are replaced by the children according to best fitness. For the genetic operations, the 
crossover operation uses two-point crossover and random mutation technique is used in which three genes 
will be mutated randomly by replacing the genes with another value. In this problem, the crossover rate is set 
as 0.9. Mutation rate is set as 0.2. The algorithm stops when it reaches a certain number of generations. 
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3. RESULT AND DISCUSSION 

For the evaluation, the data were provided by the courier center. There are 13 processes, 22 staffs, 
and 286 PPH values. The processes are from 6 different flows handling different parcel types with different 
precedence values. Each staff has varying PPH values for each of the processes. The total number of parcels 
to process is set at 5500 parcels, where it is the standard amount of number of parcels to be processed at a 
courier distribution centre. Each process has different number of parcels to process, which needs to be 
covered by the combined PPH values of the staffs working on this process. The breakdown of the number of 
parcels to process for each process is calculated. The process of handling FLY parcels of clearance COA will 
have 2090 parcels to process. Analysis of the staff scheduling module’s algorithm performance is conducted 
at the system testing stage. Parameter tunings of the algorithm is done to suit with the problem. The final 
parameters values are the ratio of clash penalty in the fitness function to 0.6, decreasing ratio of time to 0.2, 
the population size 70 and maximum generation of 20, the crossover rate is 0.9, and the mutation rate is 0.2. 
Figure 4 depicts the graphical results of a run using these settings. This setting is able to generate a good 
result with a processing time in the range of 30 seconds to 2 minutes. 
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Figure 4. 70 generations with population size 70 


The algorithm performance is also evaluated by the average and standard deviation of several 
indicators: fitness, time taken, number of staffs involved, and clashes. The fitness value ranges from 0-1, the 
higher indicating a better result. Time taken refers to the total time needed to complete the operations using 
the generated schedule, hence, it is the lower the better, with favorable values around 1. Number of staffs 
involved indicates the number of staffs that has been assigned to work in this schedule, which is the lower the 
better. Clashes refers to the number of processes assigned to staffs that clashed with one another in terms of 
processes’ precedence, this indicator is the lower the better. Table 2 shows the results of 10 runs. 


Table 2. Average and standard deviation of algorithm performance 


Runs Fitness Time taken (hour) | Number of staffs Clashes 
1 0.9195 1.3663 22 0 
2. 0.9617 1.2361 22 0 
3 0.9818 1.1 22 0 
4 0.9348 1.4834 22 0 
5 0.9552 1.0384 22 0 
6 1 0.9614 22 0 
7 0.9650 1.211 22 0 
8 0.9681 1.18918 22 0 
9 1 0.9614 22 0 
10 1 0.9796 22 0 
Average 0.9686 1.1527 22 0 
Std Dev 0.0277 0.1781 0 0 
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From the results, the average fitness value of 10 runs is 0.96, indicating the algorithm is able to 
produce high quality results in almost every run. A standard deviation of 0.02 for fitness value indicates that 
the quality of schedules generated is consistent. The time taken indicator has an average value of 1.15, 
indicating the average schedules generated will be slightly more than one hour, however still in the 
satisfactory range. A standard deviation of 0.17 is higher than the standard deviation of fitness value, hence, 
the time taken indicator variates more, either less than one hour or more than one hour. The number of staffs 
maintains the same value throughout all 10 runs in this setting, with an average of 22 and standard deviation 
0, proving 22 staffs is enough to generate a good schedule consistently. The last indicator, clashes, has a 
value of 0 throughout all 10 runs, proving the algorithm is able to generate viable and quality schedule with 
no clashes among assigned processes for each staff. 


4. CONCLUSION 

Based on observation at the courier-distribution centre, there was generally around 33 staffs working 
onsite during an operation. Based on the result, the produced schedule can bring this number down to 22 
staffs, reducing 10 staffs while not affecting operation workflow. The cut down on number of working staffs 
could amount to a substantial reduction of operation cost every month. Besides cost efficiency, the general 
time efficiency of the operations is improved. Schedules generated assign processes to staffs based on parcel 
amount to handle, which enables the flexibility of assigning more or less staff to a process based on demand. 
The processes are also assigned in a manner that maximize a staffs’ capabilities, assigning a staff to another 
process, once a process has completed. Such schedules will take up a big amount of an administrative staff s 
time to construct. To sum up, the proposed evolutionary algorithm is able to generate an optimal staff 
schedule for courier distribution center based on the time taken and total number of staff used. The proposed 
solution can improve the cost, time, and human resource efficiency of courier companies. The algorithm 
could transform staff scheduling in the logistic industry and improve the efficiency of local courier 
companies towards serving their customers better thus increasing their advantages in this increasing 
competitive, and on-demand industry. For future work, the implementation of concurrent execution is to be 
explored since the algorithm execution time took around 60 to 120 seconds to complete due to the large 
number of data. 
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